[Feature] Add RLPD Baseline #437

StoneT2000 · 2024-07-17T23:27:51Z

RLPD is sota sample-efficient online imitation learning method (without using state reset).

StoneT2000 · 2024-07-17T23:58:42Z

note that the recorded metrics / config is not aligned with other baselines yet

StoneT2000 added 6 commits July 17, 2024 16:26

work

807e1c3

Update README.md

113f8e0

work

88e5816

docs

f6abef5

Update index.md

be19922

Update README.md

59870a1

StoneT2000 merged commit 43e3699 into main Jul 17, 2024

StoneT2000 deleted the rlpd-baseline branch July 17, 2024 23:58

Provide feedback